Minimum Sample Risk Methods for Language Modeling1

نویسندگان

  • Jianfeng Gao
  • Hao Yu
  • Wei Yuan
  • Peng Xu
  • John Hopkins
چکیده

This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training procedure. Evaluations on the task of Japanese text input show that MSR can handle a large number of features and training samples; it significantly outperforms a regular trigram model trained using maximum likelihood estimation, and it also outperforms the two widely applied discriminative methods, the boosting and the perceptron algorithms, by a small but statistically significant margin.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Design of Feedback Controllers for a Convecting Fluid Flow via Reduced Order Modeling1

Fluid Flow via Reduced Order Modeling1 John A. Burns, Belinda B. King Center for Optimal Design and Control Interdisciplinary Center for Applied Mathematics Virginia Polytechnic Institute and State University Blacksburg, VA 24061{0531 Diana Rubio Center for Research in Scienti c Computation North Carolina State University Raleigh, NC 27695{8205 Abstract In this paper, we study the e ect of mode...

متن کامل

Minimum Sample Risk Methods for Language Modeling

This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training p...

متن کامل

Assessment and treatment of childhood apraxia of speech: An inquiry into knowledge and experience of speech-language pathologists

Objectives: The present research aimed to identify the assessment and treatment processes implemented by Iranian speech-language pathologists (SLPs) for CAS and to investigate the possibility of impact of their knowledge level and years of experience on their choice of assessment and treatment. Methods: A cross-sectional method using survey design was employed to obtain a sample of 260 SLPs w...

متن کامل

Financial Engineering Estimation of Minimum Risk Hedge Ratio

In this paper, the financial engineering minimum risk-based portfolio hedging model is first analyzed. It is then followed by the investigation on various major estimation methods for the minimum risk hedge ratio. The results revealed in the current study show that the HR obtained by the ordinary least squares (OLS) model is maximal and the out-of-sample hedging performance is the best; however...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005